Performance Improvement Algorithms in Big Data Analysis
نویسندگان
چکیده
منابع مشابه
MapReduce Algorithms for Big Data Analysis
There is a growing trend of applications that should handle big data. However, analyzing big data is a very challenging problem today. For such applications, the MapReduce framework has recently attracted a lot of attention. Google’s MapReduce or its open-source equivalent Hadoop is a powerful tool for building such applications. In this tutorial, we will introduce the MapReduce framework based...
متن کاملBringing High Performance Computing to Big Data Algorithms
Many ideas of High Performance Computing are applicable to Big Data problems. The more so now, that hybrid, GPU computing gains traction in mainstream computing applications. This work discusses the differences between the High Performance Computing software stack and the Big Data software stack and then focuses on two popular computing workloads, the Alternating Least Squares algorithm and the...
متن کاملScaling Polygon Adjacency Algorithms to Big Data Geospatial Analysis
Adjacency and neighbor structures play an essential role in many spatial analytical tasks. The computation of adjacenecy structures is nontrivial and can form a significant processing bottleneck as the total number of observations increases. We quantify the performance of synthetic and real world binary, first-order, adjacency algorithms and offer a solution that leverages Python’s high perform...
متن کاملClassification Algorithms for Big Data Analysis, a Map Reduce Approach
Since many years ago, the scientific community is concerned about how to increase the accuracy of different classification methods, and major achievements have been made so far. Besides this issue, the increasing amount of data that is being generated every day by remote sensors raises more challenges to be overcome. In this work, a tool within the scope of InterIMAGE Cloud Platform (ICP), whic...
متن کاملEarly Adoption: High-Performance Computing for Big Data Introducing parallel programming and big data in the core algorithms curriculum
Proficiency in high-performance computing (HPC) is today an essential skill for students in any computer science program. While traditional curricula provide several courses related to parallel programming, we are increasingly including parallel computing topics in our mandatory undergraduate and graduate algorithms classes. We briefly review our recent activities in this direction and outline ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Procedia Computer Science
سال: 2020
ISSN: 1877-0509
DOI: 10.1016/j.procs.2020.11.040